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1.  INTRODUCTION 


The  goal  in  maiQr  activities  or  contests  is  not  necessarily  to  do 
well  in  any  absolute  sense,  but  merely  to  out  perform  an  adversary. 

The  objective  of  this  paper  is  to  provide  a  model  for  such  a  contest, 
establish  the  optimality  of  certain  procedures,  and  provide  sui table 
approximations  to  these  optimal  procedures.  But  before  yielding  to 
the  Mathematics  of  the  model,  we  wish  to  fix  ideas  with  an  anecdote. 

Two  statisticians.  Bob  and  Mike,  engage  in  a  contest  to  guess 
weights  of  people  at  a  party.  They  agree  that  Bob  will  always  guess 
first.  Mike  will  then  guess  and  finally  the  person  in  question  will 
say  who  is  closer.  For  example  on  person  number  one  Bob  guessed  137 
pounds.  Mike  then  guesses  137.01  pounds  and  the  guest  declared  Mike 
the  victor.  The  contest  continued  in  a  similar  vein  and  to  Bob's 
dismay  he  won  barely  a  quarter  of  the  time. 

Intuitively  it  is  clear  that  the  second  guess er  has  an  advantage, 
and  one  of  the  results  of  Section  2  shows  that  this  advantage  is  typically 
as  large  as  the  75  percent  obtained  by  Mike  in  the  anecdote. 

To  continue  the  story.  Bob  was  so  stunned  by  defeat  and  eager  for 
revenge,  that  he  elicited  the  assistance  of  a  professional  weight  guesser. 
Mike  agreed  that  since  the  new  team  was  so  powerful  it  should  be  willing 
to  make  all  of  its  guesses  about  the  weights  of  the  guests  before  Mike 
had  to  state  any  of  his  guesses.  The  team  agreed  to  the  proposed  rule 
change,  and  Mike  then  proceeded  to  win  even  more  convincingly  than  before. 
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The  strategy  used  by  Mike  in  the  second  case  is  naturally  more 
sophisticated  than  the  one  he  used  when  he  was  matched  against  an 
equal.  This  second  strategy  derives  from  an  hierarchical  linear 
model  like  that  studied  in  Lindley  and  Smith  [8].  It  is  also  closely 
connected  with  the  James -Stein  estimator  and  was  originally  motivated 
by  the  "Batting  Average"  example  of  Efron  and  Morris  [3]. 

Our  program  begins  by  establishing  in  Section  2  a  formal  theory 
of  guessing  contests.  We  also  give  a  simple  but  very  general 
optimality  result  which  forms  the  basis  for  the  rest  of  the  paper. 

The  third  section  determines  the  exact  optimal  strategy  for  second 
guessing  under  a  certain  linear  model.  Practical  approximations  to  this 
optimal  strategy  are  worked  out  in  Section  4  .  The  final  section  gives 
a  critical  discussion  of  the  various  sources  of  difficulties  inherent 
in  applying  this  theory  of  guessing  contests.  While  the  main  point  of 
this  paper  is  to  provide  a  tractible  theory  of  guessing  contests,  we 
feel  that  the  largest  single  point  established  is  the  approximate 
optimality  of  the  simple  rule  given  by  Equation  (4.1 ). 


2k  HOTELLING'S  STRATEGY 


'Hie  structure  of  our  guessing  model  can  be  described  by 
a  system  of  four  p  vectors. 

Target  Values:  (®1,62> *  * ’ = 

Firs  t  Gue s  s :  (X^  *  ^2  *  *  *  *  j  )  =  I 

Second  Guesser' s  Hunch:  (Y. ,Y0, . . . ,Y  )  =  Y 

id  p 

Second  Guess:  (G^,G2> . . . >Gp)  =  G 

The  ©i  represent  the  real  values  to  be  guessed.  The  are 
guesses  made  by  the  person  who  goes  first  and  all  of  these  are  assumed 
to  be  available  to  the  second  guesser  before  he  acts.  The  Y^ 
represent  the  second  guesser' s  best  estimate  of  the  0^.  Finally, 
the  G^  are  the  guesses  to  be  announced  by  the  second  guesser. 

Our  principal  task  is  to  determine  how  G  should  be  based  on  X 
and  Y. 

The  objective  of  each  player  is  to  come  closer  to  0  than  his 
opponent,  so  we  begin  by  setting 


(2.1) 

where 


The  strategic  objective  of  the  second  guesser  is  therefore  to  maximize 


E  V(G,f),  i.e.,  the  second  guesser  wishes  to  maximize  the  expected 
number  of  times  his  guesses  come  closer  to  the  true  values. 

The  only  probabalistic  assumptions  to  be  made  now  are  that  (■,  X,  Y 
have  a  joint  distribution  which  is  continuous.  This  assumption  is  made 
for  convenience  and  avoids  the  ad  hoc  conventions  required  for  dealing 
with  ties. 

Now  let  v^(x,Y)  denote  the  median  of  the  conditional  distribution 
of  ^  given  X  and  Y.  A  key  role  in  our  guessing  theory  is  played 

if  Xi<vi(X,Y) 

otherwise 

These  strategies  will  subsequently  be  called  Hotelling  Strategies 
since  they  were  essentially  put  forward  in  Hotelling  [6,  p.  51 1*  There 
are  broad  differences  between  the  present  model  and  Hotelling's  problem 
in  location  economics,  but  the  relationship  seems  close  enough  to 
justify  (or  even  require)  the  name.  The  main  fact  in  this  section  is 
the  following  simple  result: 

Theorem  2.1.  The  Hotelling  Strategies  are  c -optimal,  that  is, 

lim  E  V(G€  ,0 )  -  sup  E  V(G,t>)  . 

*  ♦  o  G 

Proof.  Since  any  guess  must  be  on  one  side  or  the  other  of 

we  have 

U 


by  the  following  strategy: 


vc 


V€ 


P5C,Y(!Gi-01l  <  IXj-flJ)  <na*{P>C,Y(0i  <  X^,  ^ (0±  >  Xj)} 


The  basic  observation  about  G^  is  that 

11m  PX,Y(|G^-0il  <kl-e1l)  =  ma*{PX,Y(0i  <  X±),  PX,Y(01  >XA)}  . 

Taking  expectations  in  the  two  preceeding  relations  and  suraaing  over 
1  <  i  <  p,  the  theorem  is  proved. 

A  compelling  impediment  to  the  use  of  Hotelling  strategies  is  that 
they  require  the  knowledge  of  the  joint  distribution  of  0,  X*  Y,  or  at 
least  the  knowledge  of  v^X^Y).  The  key  task  of  the  remainder  of  this 
paper  is  to  isolate  some  feasible  circumstances  where  this  impediment 
can  be  overcome. 

To  begin,  consider  the  strategies 


Xi+€ 

if 

Xi 

<Yi 

Xj^-e 

if 

Xi 

>Yi 

where  the  second  guesser  places  his  guess  just  a  bit  to  the  side  of  the 
first  guess  in  the  direction  of  his  own  "hunch"  Y^. 

In  some  cases  one  can  show  that  these  Hunch  Guided  Guesses  are 
in  fact  Hotelling  strategies.  Certainly,  if  the  vectors  (e^>Y^,X^), 

1  <  i  <  p  are  independent  and  P^Iy^  -  NCY^Oy),  Xi|Pi,Yi  --  X^^ I ©i  ~  N(o1,o^) 
then  Vj  (X,Y )  is  on  the  same  side  of  X^  as  Y^.  This  immediately 
impl  ies  that  the  Hotelling  and  Hunch  Guided  strategies  will  then  coincide 
Without  distributional  assumptions  on  0  one  can  no  longer  speak 
of  the  optimality  of  a  guessing  strategy,  but  the  following  result  points 
out  a  case  \Wiere  the  second  guesaer  can  still  realize  a  substantial  advantage. 
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Three  Quarters  Theorem^.  If  X  =  X-h  and  Y  =  Y -0  are  identically 
distributed,  independent  and  symmetric  about  zero,  then  the  Hunch  Guided 
Guess  has  probability  3 A  of  winning  as  c  ■*  0. 

Proof.  As  e  -•  0  the  probability  that  the  Hunch  guided  guesser  loses 
is  P(Y  <  X  <  0)  +  P(0  <  X  <  Y).  By  symmetry  and  exchangeability  this 
probability  also  equals 


2P(0  <  Y  <  X)  =  P(0  <  X  and  0  <  Y)  =  l/U  . 

In  a  practical  application  of  the  three-quarter  theorem  the  assumption 
of  identical  distributions  might  seem  to  pose  some  difficulties.  Reassur¬ 
ingly,  the  result  is  quite  robust.  For  example,  assuming  unbiased  jointly 
normal  guesses  the  second  guesser  still  wins  with  probability  greater  than 
.68  when  Var  Y/Var  X  =  2.5  and  wins  with  probability  greater  than  .50 
when  Var  Y/Var  X  =  10.  (These  probabilities  are  easily  confirmed  by 
tables  of  the  bivariate  normal  (e.g.  Owen  [10]).)  Ihe  more  detailed 
assessment  of  robustness  in  guessing  competitions  will  be  dealt  with  in 
a  subsequent  report,  but  one  should  note  an  obvious  aspect  of  non-robust¬ 
ness  under  gross  changes  in  the  model  of  Theorem  2.1  is  that  the  probability 
of  the  second  guesser  winning  will  tend  to  l/2  or  1  according  as 
Var  Y/Var  X  tends  to  00  or  0. 

A  result  equivalent  to  the  above  was  told  to  the  first  author  in  1^75 
by  R.  Chacon  and  was  known  much  earlier  to  R.  Chacon  and  S.  Kochen.  The 
result  was  also  known  earlier  to  T.  Cover  in  the  form:  Between  two  "equally 
matched"  basketball  teams  the  odds  are  5  to  1  in  favor  of  the  team  leading 
at  the  half. 
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3.  GAUSSIAN  GUESSING 


Since  Hotelling  strategies  have  been  shown  to  be  optimal,  one  would 
naturally  like  to  provide  a  class  of  models  in  which  the  strategies  can 
be  determined  explicitly.  Hie  main  result  of  this  section  is  to  give 
such  explicit  strategies  under  a  multivariate  normal  model  studied  by 
Lindley  [81  and  Lindley  and  Smith  [ 91  - 

We  write  u|v  for  the  conditional  distribution  of  U  given  V, 

1  for  the  row  p-vector  (1,1,..., l),  and  Ip  for  the  p  x  p 
identity  matrix. 

Our  Gaussian  Model  assumptions  are  the  following: 


(3.1) 

and 


eln  ~  N(nip,  ip) 

2 

H~N(hq,  cr^) 

x,Y|e,n~  N(0(i  ,i  )  r) 


where 


The  physical  motives  behind  this  model  are  that  the  true  weights 
6  of  the  persons  we  see  are  viewed  as  independent  realizations  of  a 
single  fixed  random  process  vhich  was  itself  once  drawn  from  a  popula¬ 
tion  of  random  processes.  For  example,  the  parameter  p  can  be  viewed 
as  a  geographically  fixed  quantity  determined  at  a  earlier  time  by 
'random )  immigrations.  Hie  assumption  of  normality  one  made  partially 
out  of  traditional  convenience,  but  also  because  they  seem  justifiable 
in  the  weight  guessing  example.  Hie  structural  model  togettier  with 
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the  normality  lead  uniquely  to  the  Gaussian  Model  specified  in  (3.1.  ) 
The  promised  explicit  determination  of  the  Hotelling  strategy  is  now 

possible. 

Theorem  3.1.  Under  the  preceding  Gaussian  Model  the  Hotelling  strategy 
is 

if  <  7jio+(l-y)[PX+(l-0)Y] 

+  (l-ce)[P(Xi-X)  +  (l-e)(Yi-Y)] 

otherwise 

.  -2  v -1  2  2  2 

+  CTy  )  ,  a  =  +  per^ 

-2  -2  -2  -? 

+  cry  )  and  0  =  crx  (cr^  +  cry  ) 

The  proof  of  the  preceeding  depends  on  a  multivariate  calculation  which 
we  have  deferred  to  Appendix  I  in  order  to  take  up  directly  the  problem 
of  interpreting  the  result. 

The  basic  part  is  the  mixture  of  means,  7uq  +  (1-7)[PX+  (1-0  )Y ]  , 
which  is  perturbed  on  trial  i  by  the  ''mixture"  of  residuals 
a.0+  (lJ2)[P(Xi-X)  +  (l-P)(Yi-Y)].  The  coefficient  0  =  +  cr~2 ) 

appearing  in  these  mixtures  is  near  0,  l/2,  or  1  accordingly  as 

2  -2 

CjjCry  is  near  on,  1,  or  0,  This  ratio  is  one  natural  measure  of 
the  relative  abilities  of  the  two  guessers  and  this  interpretation  is 
reinforced  by  considering  the  extreme  cases.  Vftien  cr^cry  ~  ■  the  first 

P  «p 

guess  is  essentially  ignored  and  when  tj  cr  '  ~  o  it  is  the  hunch  which 

A  i 

gets  ignored.  This  last  case  is  of  particular  interest  since  it 
corresponds  to  trying  to  out  guess  a  far  better  informed  adversary. 


(3.2)  Gi  = 


Xj+e 


Xi-e 


-2  -2  -2 

where  y  -  cr  (u  +  ax 


„  -2.  -2  -2 
“  =  (<70  +  °x 
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k.  STEIN  GUIDED  GUESSING 


The  strategies  just  derived  have  the  drawback  that  they  are  functions 

2  2  2  2 

of  <yQ,  ax  and  Although  the  magnitude  of  and  of  the 

relevant  variance  ratios  may  be  sufficiently  understood  for  some  appli¬ 
cations,  the  exact  values  of  these  quantities  can  not  generally  be 
assumed  to  be  known.  The  next  objective  is  thus  to  derive  reasonable 
estimates  to  the  unknown  mean  and  variances.  One  benefit  of  this 
analysis  is  a  clearer  understanding  of  the  empirical  fact  [3 1  that 
the  Stein  estimator  performs  well  with  respect  to  the  reward 
function  V. 

A  Bayesian  approach  to  the  above  estimations  can  be  made  along  the 
lines  suggested  in  Lindley  and  Smith  [9]  but  such  a  procedure  can  prove 
quite  complex  (c.f.  Discussion  [9]  by  V.  D.  Bimett).  The  estimators 
considered  here  are  based  on  an  empirical  Bayes  procedure  which  seems 
both  simple  and  sensible. 

As  before  we  write  Z  =  (X,Y)  and  begin  by  transforming  Z  into  a 

T  T 

canonical  form.  Next  we  recall  that  11  =  pP  AP  where  P  denotes 

P  P 

the  p  x  p  Helmert  orthogonal  matrix  (c.f.  [2],  p.  102)  and  A  is 
the  p  x  p  matrix  with  1  in  the  (1,1)  position  and  all  other  entries 
zero.  We  define  (U,V)  by 

(U,V)  =  2’1/2(X,Y)  |Ip  Ip| 

\*P  _IP / 
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A  straight  forward,  calculation  shows  (U,V)  ~  N(p*,  E*)  where 


i*  =  (2p)1//2  iQ(e,0)  , 

e  =  (1,0, ... ,0 )  , 


r  2_ 

O',  I 

a2 1 

r  1 

0  "1 

‘a  0  * 

* 

+  P 

-  P 

2  p 

1  0 

z  = 

.  a2 1 

-  P 

a2!  - 
+  P 

i+  2pc^ 

0  J  U 

.0  0  . 

and 

2  1,2  2,  2  1,2  2  * 
ct+  =  2  (°x  °V  '  =  2  “  °y  '  * 

2 

From  the  canonical  form  above  one  notes  that  <r  cannot  be  meaningfully 
estimated  since  there  is  only  one  degree  of  freedom  available  for  its 
estimation. 

We  now  turn  to  the  analysis  of  important  special  cases  which 
correspond  to  qualitatively  different  contexts. 

Case  A.  Known  Variances.  We  only  need  to  estimate  and  this 

is  done  by  maximizing  the  Type  II  likelihood  (c.f.  Good  [5]).  This 
calculation  easily  follows  from  equation  (A.  l)  of  the  appendix,  and  the 
estimator  obtained  is 


A 


^2p  ZZZ 


r1 1T  r1 

ZZ  2p' 


This  simplifies  further  to  just 


h- 


u0  =  ex  +  (1-0  )Y  , 


so  the  estimated  Hotelling  strategy  becomes 


Xi+e  if  X±  <  a[0X+  (1-0  )Y]  +  (l-a)[0Xi  +  (1-0  )Yi] 


Xi-e 


otherwise 


where  the  parameters  a  and  0  are  as  specified  in  Theorem  3.1. 


2  2  2 

Case  B.  a  =  0}  crv  =  or„  = 

y.  X  Y 

model  simplifies  to 


2  2 

5  and  cr„  unknown.  The  canonical 

~  C7  " 


\l/2, 


o  /I  0  \  -II  0 

(U,V)  ~  N|(2p)-L/Cp  (e,0 J,  B  [  P  +  2a^  P 

U  O  I  w  0  0 


and  this  time  estimators  are  easily  found  without  carrying  out  the 

2  2-1 

likelihood  maximization.  We  take  the  estimators  of  y^,  t  =  (6  +2c7q) 

2 

and  6  ,  respectively,  given  by 

A 


an 


s0 .  (2p  rlf\ 

*  -  p<  £  Uj)'1 
i-2  1 


%2  =  p^W1  . 


In  terms  of  X  and  Y  we  then  get 
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*0 


r  *  p(  £  [—  (X.  +  Y  )  -  ~  (X  +  Y)]2) 

i-1  yfl  12 


t2  =  (p)'1  t  (X  -Y  )]2  . 
i=l  /2 


The  approximate  Hotelling  strategy  for  this  case  is  therefore 


Xi+e  if  X.  <a-  |  (X+Y )  +  (1-5)  •  |  (W 


otherwise 


where 


a  =  [  t  [—  (X  -Y  )J2][  £  (X.+V  -  -7=  (X  +  Y)}2]  . 

1-1  /2  1  1  i-1  72  1  1  ^ 


One  should  note  that  a  has  a  natural  interpretation.  It  is  just 

the  ratio  (between  the  guesser  variance)  f  (between  the  trial  variance). 

The  optimal  strategy  favors  using  ^(X+Y)  when  ct  is  close  to  1 
1  A 

and  favors  — (X^+Y^)  w^en  cc  is  close  to  0.  Also,  the  strategy  can 

A 

be  improved  slightly  by  replacing  a  by  1  if  it  happens  that  «  >  1. 

2  2  2  2 

Case  C.  cr^  =  0,  Oy  =  ™,  cr^  Known;  Unknown. 

This  is  a  case  we  feel  to  be  of  particular  interest.  Calculating 
as  before  we  find  that  a  Stein  estimator  determines  the  approximate 
Hotelling  strategy,  but  that  it  plays  a  cameo  role  since  the  strategy 


w 


simplifies  to  just  "betting  on  the  side  of  X."  This  simple  result 
gives  some  theoretical  justification  to  the  otherwise  somewhat  mysterious 
empirical  fact  that  X  performs  even  better  than  the  Stein  estimator  in 
terms  of  "gambler's"  loss  on  the  batting  average  data  set  (c.f.  Plackett’s 
Comment  [3,  P-  ^16 ]  and  an  easy  computation). 

2 

The  formal  analysis  begins  as  in  Case  B.  Since  a^.  =  oo  the  Y 

is  uninformative  and  the  analysis  must  rest  on  X  above.  Also  since 

2 

tr  =  0  the  canonical  form  of  the  X1  s  marginal  model  can  be  simplified 
to 


U*  ~  N(pl/2,0e  .  (ajJ  +  O^Hp) 
The  obvious  estimate  of  is  given  by 


M0-x 


2  .  _2 


-1 


and  if  we  require  that  the  estimate,  T„,  of  t„  ■  (ov+oQ)  to  be  unbiased, 

A  A  A  Q 

-1 

,  -  \(  V  (v  _Y  \c 

X 


A 


=  (p-3)(  t  (X.-X)2)  1  , 
i=l  1 


becomes  the  natural  choice  and  the  estimated  Hotelling  strategy  is 


Ci  +  e  if  Xj_  <a*X  +  (1-6*^ 


Gi  - 


ve 


otherwise 


where  a*  =  oin(l, a)  and  a  =  ct2(p-3)[  £  (X. -X) 

A  1=1  1 


-?  “1 
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The  direction  of  the  guess  on  the  side  of  is  determined  by 

a*X  +  (l-a*)x^  which  is  precisely  the  Stein  estimator  as  modified 
by  Lindley  (c.f.  [7]  and  the  discussion  following  [9])* 

Now  since  the  convex  combination  of  X  and  will  always  be  on 

the  same  side  of  X.  as  X  the  estimated  Hotelling  strategy  can  be 
more  simply  written  as  just 


(4.1) 


if  Xi  <  X 

otherwise 


This  is  an  extraordinarily  simple  procedure  in  a  model  which  we 
feel  may  be  realistic  in  several  sporting  and  business  contexts. 

To  assess  the  performance  of  this  Stein-guided  strategy  guessing 
trials  were  simulated  for  a  variety  of  special  cases.  The  were 

chosen  as  0^=0,  6^  =  i,  and  0^  =  1?  for  each  of  i  =  l,2,...,p 
with  p  =  10  and  then  with  p  =  lOOj  thus,  in  all  3x2=6  cases 
were  considered. 

As  an  illustration  of  the  computation  consider  the  case  where 
0^  =  i  and  p  =  100.  In  this  case  200  repetitions  were  made  as 
follows : 

(1)  X  was  generated  as  N(e,I)  with  0  =  (1,2,. . . ,100). 

(2)  was  calculated  by  (4.1)  with  €  =  lO-^. 

(3)  V  was  then  calculated,  and  the  process  was  repeated  200  times. 

(4)  The  200  realizations  of  v/100  were  used  to  estimate  the  density 
of  V/lOO,  the  percentage  of  times  the  second  player  wins  using  the  G 
of  (4.1). 
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(5)  This  density  was  plotted  in  Figure  2  (in  this  case,  the 
unshaded  density  in  the  middle  graph ). 

From  Figure  A  one  learns  by  looking  at  the  unshaded  density  in 
the  top  graph  that  when  as  many  as  10  parameters  growing  like  i 
are  to  be  guessed  that  the  modal  percent  of  correct  guesses  made  by 
the  second  guesser  is  about  95  percent.  The  general  conclusion  to  be 
drawn  from  Figure  A  are  (1)  the  more  parameters  to  be  guessed,  the 
greater  the  advantage  to  the  second  guesser  and  (2)  the  more  spread 
out  the  0^  to  be  guessed,  the  more  the  advantage  to  the  second 
guesser. 

In  Figure  B,  these  conclusions  are  further  examined  by  taking  the 
0^  themselves  to  be  random.  Here  p  =  20  was  fixed  throughout.  First 
we  took  a  realization  of  0  ~  N(0,  4l2Q).  Then  POO  of  the  X's  we 
generated  with  the  same  fixed  underlying  P  (just  as  in  Figure  A). 

The  density  of  v/200  was  estimated  as  before,  and  altogether  25 
runs  were  made.  The  25  runs  produced  remarkably  similar  estimates  of 
the  density  of  V/200,  and  the  estimates  from  runs  number  1,  17,  and 
21  were  selected  as  indicative  of  the  variability  in  the  25  runs. 
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x  *  proportion  of  p  trials  won  by  the  second  guesser 


Estimated  Density  of  the  Proportion  of  Second  Guesser  Wins 
Using  the  Stein-Guided  Optimal  Strategy  (Based  on  Runs  of 
200  Contests ). 


Estimated  density 


0  .5  1.0 


*  s  proportion  of  p  *  20  triols  won  by  the  second  guesser 


3.  Estimated  Density  of  the  Proportion  of  Second  Guesser  Wins  in 
p  =  20  Trials  Using  the  Stein-Guided  Optimal  Strategy  (3  Runs 
of  1<X)  Contests  and  Combined  Runs  of  2500  Contests). 


REALITIES  OF  APPLICATION 


Almost  any  discussion  of  the  precetding  theory  eventually  turns  to 
the  problem  of  football  betting,  and  it  seems  generally  worthwhile  t r 
note  the  theory  is  not  applicable  to  that  problem.  A  key  reason 
is  that  the  bookie  (or  person  "setting  the  line")  is  not  trying  to 
estimate  the  actual  point  spread.  He  is  trying  to  produce  a  point 
spread  which  will  produce  a  nearly  equal  number  of  takers  on  ea-'h  side 
of  the  spread.  The  bookie  is  therefore  not  a  "first  guesser"  in  th' 
sense  of  this  paper  and  our  theory  naturally  does  not  apply. 

Consider  instead  two  bookies  of  equal  caliber,  one  of  whom  sets 
his  line  on  Monday  and  the  other  on  Tuesday  (for  the  game  on  f>  unday). 

If  the  Tuesday  bookie  only  wished  to  obtain  a  more  even  distribution 
of  customers  on  either  side  of  his  line  than  the  Monday  bookie,  he 
should  be  able  to  do  so  in  almost  3/4  of  the  games  by  using  the  Hunch 
guided  guessing  of  Section  2,  In  this  case,  each  bookie  is  a  bona  fi d< 
guesser  of  that  spread  s  which  will  evenly  split  the  pel  of  betters. 

Since  there  are  actually  many  games  each  week,  the  Tuesday  bookie 
could  actually  out  perform  the  Monday  bookie  by  using  the  tein  guided 
strategy  of  Section  4,  particularly  (4.1  ).  The  assumptions  of  (■'.!)  may 
not  be  applicable  to  the  ho] <  set  of  games;  but  if  one  considers  only 
non-carismatic  games  outside  the  bookie's  city,  then  (3.:)  s-ems 
reasonable.  (This  is  a  stratification  step  to  obtain  increased  homogeneity 
of  the  spreads  to  be  guessed. ) 

The  examples  put.  forward  above  are  in  the  long  tradition  of  gedanken 
experiaente ,  and  the  problem  of  producing  a  truly  telling  application  remairu 
open.  An  intriguing  aspect  of  ■>  theory  of  this  nature  is  that  it  is  only 
necessary  to  lind  one  good  application. 
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Appendix  I:  Proof  of  Theorem  5.1 

Qy  Theorem  2.1  the  problem  depends  on  the  calculation  of  the  posterior 
median  v^(X,Y)  which  by  the  normality  assumptions  (5.1)  coincides  with 
the  posterior  mean.  The  argument  given  here  for  completeness  is  similar 
to  those  of  Lindley  [7 1  and  Lindley  and  Smith  ['•>'.  It  depends  on  the 
well  known  fact  (c.f.  (1),  p.  27)  that  if  U  and  V  are  jointly  normal 
then 


U|V  ~  N(EU  -  EV  •  C  +  >^#v) 

where  ;  =  7^  -  ym'rW  4  vu'  "w  denotes  ***  covariance 

matrix  of  V  and  so  on. 

Setting  Z  =  (X,Y)  and  applying  the  above  to  (5.1)  we  have 
(A.1)  2~><WV'  !2Z> 


where 


7,7. 


where  <JJ>  denotes  the  Kronecker  matrix  product  and  is  the  matrix 

with  all  1  entries.  Further, 

E(«#!z)  -  E  )  -  E(Z)  •  '  «  7,  •  r. 


-1 

7,7.  '  Zf 


r- 


where 


2  2  T 

r  =  (tZI  +  all  . 
ee  e  p  \i  p  p 


We  now  determine  T.^  .  First  we  note  that 


T  T 

11  =  p  Px  A  P 

P  P 


where  P  denotes  the  Helmert  orthogonal  matrix  (c.f.  [2],  p.  102)  and 
A  is  the  p  x  p  matrix  with  1  in  the  (1,1)  position  and  all 


remaining  entries  zero. 


One  then  notes  that 


d2®p)  yi^p1) 


reduces  to  just 


diagtcr^cTy}  ®  Ip  +  :p  +  ® A 


which  makes  it  straightforward  to  show 


Ezz  =  (I?®pT)  8  (T2®p) 


where 


lax 

0  \ 

17  0  | 

I1  _1\  / 

(c 

-2 
°Y  1 

|® 

o  ai  J 

p-l' 

+  u  M 

-2-2  2 


-2  -2  2 
a  a  a  al 
x  y  0  p-l 


) 


where  y  and  a  are  as  in  Theorem  3.1. 

From  these  results  an  explicit  expression  for  2  •  C  is  readily 
obtained.  We  first  note 

fh  rn  fp  mi 

z-  K,  =  (XP^YP1)  ©  (p  ,PA)  E00 

and 

e(PT,  PT)  =  (a"2,a‘?)  <S> 

Thus 


(A.2)  Z-C  =  (XPT,YPT)[(a“2,crY2)  ®  q  ^  jl 

i  p-l ' 


l(a0Tp+PCruA)P  =  (aX2x  +  CTY2Y)A 


where 


a27  0 

0  4azp.i 


p 


Now  represent  X  as 


Then  we  have 


X  =  XI  +  (X-Xl  )  . 
p  p 

XPT  =  X(p1/2,0,...,0)  4-  (0,X(2)) 
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v/here 


(h'h 

\ 


X1+Xg+-  •  •-t-Xp_1-(P-i  )x^. 

4  (i-p)p 


XA  =  cr27  X(pl/2/),....,0)P  +  a2a(0,X(2))P 


=  a2yXl  +  a2a(X-XL  ) 
P  0  P 


A  similar  result  holds  for  YA.  Introducing  the  resulting  expressions 


for  XA  and  YA  in  (A.  2)  yields 


(A.3)  (a~2XKT~?Y)A  -  (1-?)[PX+  (l-^)Y]lp+(l^)[P(X-Xlp)+(l-P)(Y-Ylp)) 


2-2-2  2  -2  ~P 

since  yo  (a  +crv  )  =  1-7  and  a<T  (crv  +cr  ')  =  1-3. 

Ax  U  A  I 

From  the  expression  just  obtained  in  (A.3)  E(Z)*£  and  hence 

E(o)-E(Z)-£  are  easily  found.  To  get  E(Z)*£  simply  substitute  p^l^ 
for  both  X  and  Y.  This  gives 


E(ZK  =  * 


At  the  same  time  E(o)  =  p^l  ,  so  we  have 


(A.4) 


E(e)-E(zK  =  7^1 


The  proof  of  Theorem  3.1  is  now  an  immediate  consequence  of  (A.3) 


and  (A.4 ). 
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